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ABSTRACT 

Contamination from instrumental effects interacting with bright astrophysical sources is the primary 
impediment to measuring Epoch of Rcionization and BAO 21 cm power spectra an effect called mode- 
mixing. In this paper we identify four fundamental power spectrum shapes produced by mode-mixing 
that will affect all upcoming observations. We are able, for the first time, to explain the wedge-like 
structure seen in advanced simulations and to forecast the shape of an 'EoR window' that is mostly 
free of contamination. Understanding the origins of these contaminations also enables us to identify 
calibration and foreground subtraction errors below the imaging limit, providing a powerful new tool 
for precision observations. 



1. INTRODUCTION 

Observations of redshifted 21 cm emission from neu- 
tral hydrogen have the potential to reveal the process 
of reionization and provide an important new tool for 
observational cosmology (see Furlanetto et al. 2006 and 
Morales & Wyithe 2010 for recent reviews). A number 
of experiments are currently under construction to ob- 
serve the 21 cm power spectrum, including LOFAR (LOw 
Frequency ARray^), PAPER (Precision Array for Prob- 
ing the Epoch of Reionization"^), the MWA (Murchison 
Widefield Array''), and CHIME (Canadian Hydrogen In- 
tensity Mapping Experiment"''). The primary challenge 
for these observations is removing contamination from 
the bright astrophysical foregrounds interacting with in- 
strumental effects and foreground mis-subtractions. 

The astrophysical foregrounds are up to five orders of 
magnitude stronger than the EoR signal, and initial stud- 
ies of the 21 cm foreground concentrated on identifying 
all of the potentially offending sources (e.g., Matteo et al. 
2002; Oh & Mack 2003; Matteo et al. 2004; Gnedin & 
Shaver 2004; Santos et al. 2005; McQuinn et al. 2007; re- 
viewed by Morales & Wyithe 2010). The general consen- 
sus is that all known astrophysical foregrounds are either 
spectrally smooth or at known editable frequencies (e.g., 
galactic radio recombination lines), and none of them 
mimics the spherical symmetry of the EoR's redshifted 
emission line. The spectrally smooth astrophysical fore- 
ground emission dominates at small k\\ (line-of-sight wave 
numbers) , but quickly falls below the EoR signal strength 
at higher k\\ values. Conceptually, purely angular modes 
are dominated by the foregrounds, but the EoR signal 
can be observed in the line-of-sight (frequency) modes 
(Zaldarriaga et al. 2004; Morales & Hewitt 2004; Jelic 
et al. 2008; Wang et al. 2006; Barker et al. 2009). 

The difficulty is that no instrument is perfect, and 
small instrumental and observational effects can throw 
the strong angular foregrounds into the frequency (fcy) 
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direction. Effects include chromatic side lobes from 
sources producing line-of-sight ripples (Bowman et al. 
2009; Liu et al. 2009), small calibration errors leading 
to mis-subtraction of the chromatic PSFs (Datta et al. 
2010), Faraday rotation of the galactic synchrotron beat- 
ing with polarization mis-calibrations (Gcil et al. 2011), 
chromatic primary beams, and numerous other effects. 
Collectively these are called mode-mixing foregrounds, 
and have been the primary focus of the EoR foreground 
subtraction community for the past few years. 

Precision simulations by Datta et al. (2010) have 
shown a very distinctive wedge shape in the fcy vs. 
fc_L power spectrum for three types of mode-mixing 
contamination — small amplitude errors in bright source 
removal, small position errors in the bright source re- 
moval, and small calibration errors. However, the origin 
of the wedge shape is not clear. In this paper we expand 
on the work by Liu & Tegmark (2011) and Vedantham 
et al. (2011) to develop a mathematical framework for 
subtractive foreground removal and mode-mixing con- 
tamination in §2. In §3 & §4 we then explore four mode- 
mixing signatures and develop a quantitative and qual- 
itative imderstanding of their origin and why they are 
fundamental to the measurement process. We conclude 
in §5 by discussing how our new understanding will allow 
us to calibrate and identify foreground errors below the 
imaging limit. 

As the first installment in an informal series of papers 
on mode-mixing, this paper concentrates on fundamen- 
tal contamination common to all observations with sub- 
sequent papers exploring array-layout dependent effects 
and new statistical methods for mitigating these instru- 
mental contaminations. 

2. FRAMEWORK 

Starting with the notation of Liu & Tegmark (2011) 
the band powers from the measured data (m) can be 
determined with a simplified version of their Equation 2 

Pa = m^E„m, (1) 

where is the power spectrum analysis and foreground 
subtraction, we have used the fact that interferometric 
data is naturally zero mean, and we have omitted the 
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foreground bias term (in essence it is the bias term we 
are interested in calculating). For this paper it will be 
easier to split the matrix operator into two symmetric 
halves 

E„ = D^D„, (2) 

each half describing the linear analysis and foreground 
subtraction pipeline. The processed data for each band 
m'Cka) (the k-space pixels for which |k| falls in the spher- 
ical shell of the band a) is then given by 

m'(ka) = Da(ka, v; a)m(v) (3) 

and the band power by 

p„ - m'^m'„ = (m^D;^)(D„m). (4) 

Throughout we will use the operator notation A(a, b; x) 
to signify a transformation from coordinate vector b to 
coordinate vector a that depends on parameters x, so 
the operator in Equation 3 is interpreted as transforming 
the data from the raw visibilities v to the A;-space pixels 
within a band ko,, which depends on the band a we are 
interested in. A simplified version of a typical analysis 
pipeline would be 

D„(k„,v) = S(k,,k) FiD(k,u) B%,v) (5) 

where the visibility data is griddcd (B ) to the uv plane 
as a function of frequency (uv vs. frequency coordinates 
denoted by u), the frequency direction is Fourier trans- 
formed along the line of sight (frequency) direction and 
the coordinates are mapped to cosmological wavenum- 
bers k, and the wavenumbers associated with the band 
power in question arc selected (S). 

While most astrophysical foregrounds arc smooth in 
frequency and thus separable from the nearly spheri- 
cal EoR signal in principle (all emission near fcy = 0, 
Morales et al. 2006; McQuinn et al. 2006), the effect 
of chromatic instrument responses, imperfect foreground 
models, and/or imperfect calibration conspire to throw 
this nearly purely angular foreground power into the line- 
of-sight fc|| direction, masking the faint EoR signal. Here 
we label the power spectra due to various mode mixing 
terms as pe (k) , which is defined as the square of the data 
residuals r(k) in fc-spacc (po = |rp). We can formally 
calculate the shapes of these mode-mixing power spectra 
by considering the difference between the observed and 
subtracted foregrounds. Looking at the residuals for a 
simplified software holography chain we have 

r(k) = FiD(k,u)B^(u,v)M(v,u)F(u,(?)/(0) 
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The first line of Equation 6 shows the observed fore- 
ground as the true foreground / measured by the instru- 

ment M and run through a holographic analysis FB . 
The second line is the subtracted foreground, with the 
model foreground / and model instrument B replac- 
ing the true foreground and instrument. The analysis 
portion of how visibilities are transformed into fc-space 
measurements are usually the same for both the mea- 
surement and the model (they can differ for computa- 
tional reasons), so we can introduce an analysis operator 

A(k,v) = FB^. 

In general the residual foreground contamination is due 
to an admixture of model errors (/ I) and calibration 
errors (M B), but it is instructive to look at these 
sources of errors independently 

rM = AB(/ — j) model error, calibration correct (7) 
rc = A(M — B)/ calibration error, model correct. (8) 

Formally our work is now done. We can parametrize the 
kinds of errors we make, e.g. amplitude and or position 
model errors in Equation 7, and calculate the associated 
power spectra Pe(k) for our instrument. 

However, it is useful to qualitatively identify the origin 
of these shapes. In §3 & 4 we identify four fundamental 
residual power spectrum shapes that will be seen by all 
upcoming instruments and describe their origins. In this 
paper we concentrate specifically on the contamination 
that will be seen by any upcoming observation, indepen- 
dent of the array layout. In subsequent papers we explore 
the effects of array layout and non-uniform sampling of 
the uv plane (Hazclton et al. in preparation), and more 
advanced statistical techniques for removing the instru- 
mental effects. For this paper we limit ourselves to effects 
related to fundamental information loss on the antenna 
scale. 

3. MODEL ERROR SHAPE AND THE 
FOREGROUND WEDGE 

First we consider the effect of foreground model errors. 
In this case we assume the calibration of the instrument 
is perfect both the complex gain and beam shape of 
each antenna is known exactly but the subtracted fore- 
ground model is not equal to the true sky (Equation 7). 
Typical model errors include mistakes in the amplitude 
and location of compact sources, errors in the amplitude 
and shape of diffuse emission, and polarization errors for 
both compact and diffuse sources. Both thermal noise 
and systematics contribute to foreground model errors, 
and the PSF sidelobes often make it impossible to disen- 
tangle the true brightness and locations of the sources in 
the field. Accurately determining the foregrounds is par- 
ticularly difficult in the confusion limit, and Liu et al. 
(2009) and Bowman et al. (2009) have done excellent 
work on improving the accuracy of the foreground mod- 
els in the confusion limit. 

In Figure 10 of Datta et al. (2010) random errors in the 
foreground model generated a very distinctive 2D fc-space 
"wedge" which has now been seen in observations with 
the MWA prototype (Bernardi, et al. in preparation). As 
we will explain in this section, this wedge is due to the 
chromatic instrument response and inherent information 
loss. 



Any interferometric measurement suffers information 
loss on the scale of one antenna. An antenna sums the 
electric field across its surface into a single voltage signal 
which is cross-correlated with the integrated electric field 
from a second antenna to form a visibility. In this process 
the variations in the electric field correlation across the 
antenna surface (e.g. from a source far from field center) 
are lost. Equivalently, in the uv plane one visibility is 
formed by integrating the true uv correlation with the 
beam pattern of that antenna pair B(v, u) over a small 
frequency channel as shown in Equation 6. Modern ar- 
ray simulators use this integral of the uv plane to predict 
the visibilities including direction dependent gain pat- 
terns. (For a conceptual introduction to interferometric 
measurement, please see Chapter 3 of Morales & Wyithe 
2010). The first step of any analysis is to reconstruct an 
estimate of the uv plane j(u) by gridding the visibilities 

(B in Equation 6). However, B is not invertible and the 
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the operation B B does not recreate the true uv plane 
distribution of the sky. 

Various statistical priors can be used to mitigate this 
information loss, and deconvolution is a powerful tech- 
nique for reconstructing the contributions to each visi- 
bility below the antenna scale. Effectively the statistical 
prior that most of the flux is due to a list of sources (or 
CLEAN components) allows their contribution to each 
visibility to be determined accurately despite the smear- 
ing due to the antenna integration in the uv plane. 

However, the EoR signal is below the imaging noise 
floor for the first generation observatories (S/N less than 
one per per visibility, or equivalently, phase noise greater 
than one radian). Thus deconvolving the EoR signal is 
not possible, and we must inherently work with a 'dirty' 
residual map (this is also true for all CMB measurements 
and other diffuse power spectrum measurements). Fore- 
ground model errors appear as small spurious signals in 
the EoR dirty map. First we will explore the effect of am- 
plitude errors which exhibit the basic features common 
to several of our model and calibration error signatures, 
then we will describe source locations errors in the sub- 
sequent sub-section. 

3.1. Amplitude errors 

An error in source brightness will leave a small resid- 
ual positive or negative source at the location of the true 
source, and is conceptually the simplest of the contami- 
nations. Figure 1 shows the true correlation of a fiat spec- 
trum source far from the field center in the u, / space (or 
equivalently fcj, and line-of-sight Mpc) with a few base- 
lines overlaid. The oscillation in k± is given by the dis- 
tance from the field center (quick oscillations for sources 
near field edge) and for a fiat spectrum source amplitude 
is constant in frequency. The baselines all migrate to 
larger distances (in wavelengths) with higher frequency, 
with the slope of the chromatic migration increasing with 
baseline length. 

Figure 2 then zooms in on a small region of Figure 1 
to explore the effects of measurement (lefthand panel) 
and reconstruction (righthand panel) for a single base- 
line. In the left hand panel the true correlation for the 
residual source (vertical corrugations) is integrated by 
the antenna beam (grey rectangles) to form the residual 
visibility. The measured visibility traces the true corre- 
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Fig. 1. — This cartoon represents a slice through the three di- 
mensional u, / space, where u is proportional to fc^ and / is pro- 
portional to line-of-sight distance in Mpc. The vertical corrugation 
shows the spatial-frequency corrugation of a fiat spectrum source 
away from the center of the field (real part shown). The diag- 
onal stripes show the paths of individual antenna baselines as a 
function of frequency. While the distance between two antennas 
in wavelengths (or k±) increases with frequency for all baselines, 
the mis-alignment angle is larger for longer baselines. It is this 
increasing mis-alignment of the baselines with the corrugations of 
a residual source that create the foreground wedge. 



lation as shown by the size of the hexagons and the line 
width. (Mathematically this is a sampled convolution 
in u of the beam and the true correlation — or the true 
correlation times the direction dependent gain of that 
antenna pair.) As expected, the measured visibility os- 
cillates as a function of frequency due to the chromatic 
increase in baseline length. 

The difficulty comes in the reconstruction of an unbi- 
ased estimate of the sky. As previously mentioned we 
have lost information both where there are no baselines 
(missing spatial frequencies) and on scales smaller than 
the antenna in the process of measuring the signal. Be- 
cause the signal to noise on the EoR signal is much less 
than one for a single baseline, we must resort to grid- 
ding the visibilities to form a dirty map. Gridding with 
the holographic antenna pattern guarantees an unbiased 
power spectrum measurement (Morales & Matejek 2009; 
Tegmark 1997), but the exact form of the gridding is not 
important for this discussion. 

The righthand panel of Figure 2 shows the effect of 
gridding for one baseline. The sinusoidal variations in the 
visibility are spread out by the gridding kernel (boxes), 
resulting in regions of high and low amplitude repre- 
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Fig. 2. — These figures zoom in two a small region of the u, / piano to study the efi'ect of baseline chromaticity. In the left hand panel, the 
vertical corrugations from a residual source are shown, with a line showing the measured visibility. The measured visibility is the integral of 
the true correlation function (grey corrugations) times the antenna beam (grey boxes in left hand figure). The resulting measured visibility 
wraps as a function of frequency, as indicated by the line width and the size of the hexagons. The right hand figure then shows the result 
of the gridding process, where the measured visibilities arc gridded to the reconstructed u, / plane. Effectively the measured visibility is 
smeared over a region of u by the gridding kernel. Light grey indicates a small (real component) reconstructed in that region of u, /, while 
dark grey indicates a large value. When we Fourier transform in the line-of-sight direction to form the power spectrum cube (Figure 4), 
we will see oscillations in the line-of-sight direction as indicated by the oscillating grey values along the vertical dashed line. 



sentcd by the grey scale. This produces oscillations in 
the line-of-sight direction as seen by the varying ampli- 
tude along the dashed line. This oscillation is not in the 
original source (lefthand panel), but because we lost in- 
formation on the scale of the antenna unavoidably shows 
up in our reconstructed signal (right hand panel). When 
we Fourier tranform in the line-of-sight direction to get 
into the three dimensional k-space this reconstruction 
error throws power that was originally purely in the fcx 
direction into the fcy dimension — the hallmark of mode- 
mixing. 

Figure 3 shows a precision simulation similar to the 
simulations by Datta et al. (2010) of a single mis- 
subtracted off-axis point source as seen by a simplified 
array. In addition to the phase wrap predicted in Fig- 
ure 2b, the longer baselines produce higher frequency 
contamination due to their higher chromatic migration 
angles. Mathematically, the wavelength of the residual 
source in the u direction Ax is related to the wavelength 
of the contamination in the line-of-sight AloS direction by 
the baseline to wavelength conversion u(A) = u(to)//c, 
giving the relationship 
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(9) 



Fourier transforming in the line-of-sight direction and 
converting to cosmological coordinates (Morales & He- 
witt 2004) one obtains, after a little algebra. 



k|| w ^'(rad) kx 



Dm(z) E(z) 
(1 + z) 



(10) 



9' (rad) is the vector distance of the residual source from 
the field center, fcx is the baseline length in cosmological 
coordinates, and the cosmological terms in parenthesis 
range from 0.63 at a redshift of 1 to 3.4 at a redshift of 8 
(r^A = 0.728, h = 0.704, flat universe following notation 
of Hogg 1999). 

The contamination in Equation 10 is remarkably sim- 
ple: a power law of 1 relationship between fcx and kn, 
depending only on the angle of the source from the field 
center and an order one constant based on the cosmol- 
ogy. Figure 4 shows how this contamination will appear 
in the fcx vs. k\\ space. This is an approximate relation- 
ship, as only a small length of the line-of-sight oscilla- 
tion is contributed by one baseline, and the contributions 
of different baselines (at same fcx) are incoherent. The 
sum of these short incoherent oscillations at the same 
wavelength is a strong peak at the associated line-of- 
sight wavenumber fcy, but will contain power at other 
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Fig. 3. — This figure represents a uf slice through a precision 
simulation of a simple array, and illustrates the contamination de- 
scribed in Figures 1—4. On the right half of the figure (positive u) 
there is a single baseline in a reconstructed uf plane (analogous 
to Figure 2b), and on the left side there are numerous baselines 
of different lengths. The color scale shows the reconstructed phase 
for a single residual source 14° from the field center (black = — tt, 
white = -|-7r, grey = phase or no baseline contribution). The 
phase wrap is clearly seen with increasing frequency in the single 
baseline on the right, and the increasing rate of phase wrap is seen 
in the longer baselines on the left. 

wavelengths and the magnitude of the mode-mixing wiU 
decrease as more baselines are added (smoother PSF). 

There is, however, a soft limit to the line-of-sight con- 
tamination imposed by the instrument field-of-view. The 
corrugations of the residual sources are band limited by 
the instrument field-of-view, giving a maximum line-of- 
sight contamination of 

k||Max«iFoV(radians)fc^(^^^^^). (11) 

This identifies a convenient EoR window above k|| Max 
where foreground model errors will not contaminate the 
EoR signal (after Vedantham et al. 2011). As we will see 
in later sections, the k|| line plays an important role 
in determining the region where the EoR signal can be 
measured and the relative importance of different con- 
taminations. 

The small residual sources from amplitude errors have 
a constant magnitude as a function of k± (FT of a 6- 
function) , so the amplitude along any dashed line in Fig- 
ure 4 is constant. However, different lines in the wedge 
correspond to sources at different (projected) distances 
from the phase center so the amplitude can vary per- 
pendicular to the k|| Max line. The characteristic shape 
of amplitude errors is a wedge of power below ky Max 
that is constant parallel to the fc|| cx kj_ diagonal but 
varies in the orthogonal fcy cx —k± direction, with errors 
closer to the field edge appearing higher in the wedge. 
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Fig. 4. — This figure shows a slice through the fcy vs. kj_ space 
on logarithmic coordinates. The residual contamination described 
in Figures 1 & 2 appears as a line of contamination along fcy oc kj_, 
with the intercept depending on distance from the phase center. 
The instrumental field-of-view provides a soft cutoff in the contam- 
ination, leaving an EoR window that is free of foreground model 
errors. The amplitude of the contamination depends on the PSF 
of the instrument (smoother PSF leads to lower contamination), 
but in general it will be difficult to remove foreground contami- 
nation from within the mode-mixing wedge. For amplitude errors 
the contamination is uniform in amplitude along lines of fcy cx kj_, 
while the contamination from location errors increases ^linearly 
with 



In the simulations by Datta et al. (2010), mis-subtracted 
sources were scattered uniformly across the image, cre- 
ating many such diagonal lines for sources at different 
projected distances from the image center. These many 
diagonal lines in the power spectrum added together to 
produce the wedge seen in that work. 

WHY DON'T WE JUST... 

The origin of the foreground wedge is the inherent in- 
formation loss below the antenna scale, coupled with our 
inability to deconvolve when the signal-to-noise is less 
than one. However, this is not at all self apparent and it 
is natural to suggest a number of alternatives. Common 
ideas include: 

Why don't we just grid in meters instead of wave- 
lengths? Gridding in meters skews Figures 1 & 2 so the 
baselines are vertical but the corrugations of the residual 
source and the line-of-sight Fourier transform are at an 
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angle (which increases with baseline length). The mode- 
mixing is unchanged by this coordinate transformation — 
the same effect occurs, just in a skewed coordinate frame. 
While gridding in meters may be more computationally 
efficient, the effect described here is independent of the 
gridding coordinates. 

Why don't we Direct Fourier Transform the visibilities 
without gridding ? This seems like the obvious solution as 
it eliminates the gridding step entirely, but there are two 
reasons this does not solve our problem. First, the an- 
tenna field-of-view changes significantly with frequency 
(even if all the antennas are identical to each other), and 
we must be able to insert this into our analysis. One 
way of doing this is to form maps at different frequen- 
cies via DFT, which are then tapered with the primary 
beam before assembling into an image cube (this is what 
NVSS did to assemble mosaics). However, multiplying 
by an image space taper is identical to convolving in the 
uv plane. This smooths out oiiv visibilities in the uv 
plane, reproducing the effect seen in the righthand panel 
of Figure 2. Gridding with the antenna beam is identical 
to tapering in the image (this duality is the basis of soft- 
ware holography). Any image tapering reproduces this 
effect. 

One can consider performing a full sky DFT and not 
applying a taper (as one might be tempted to do if the an- 
tenna beam was frequency independent), but this turns 
out to be worse for a power spectrum analysis. Effec- 
tively, contributions from a visibility are reconstructed 
across the full sky, even in areas where the telescope had 
no sensitivity (e.g. far outside the primary beam). This 
introduces bias into the power spectrum measurement 
(Tegmark 1997). Fundamentally we know the contribu- 
tions to a single visibility came from a region of the uv 
plane, not a point, and we need to reconstruct this effect. 
We can do this either by gridding in uv or tapering the 
image, but not doing either actually biases our result by 
over emphasizing areas of the sky. 

What if we only include visibilities which exactly over- 
lap in u when performing the line-of-sight FT? The first 
problem is that if the antenna field-of-view changes, the 
integral of the uv plane that went into making those vis- 
ibilities differs. This reproduces the difficulty discussed 
in the previous question and tapering/gridding is needed 
to calibrate the data. Even if the antennas are not chro- 
matic, this represents a drastic decrease in the effective 
collecting area. One cannot design a two dimensional 
antenna layout that provides the necessary uv coverage 
without discarding the majority of the baselines (even a 
regular grid of antennas only has good redundancy along 
the two grid directions). 

What about deconvolving the EoR signal itself! This 
could be done for the SKA or other instruments which 
will image the EoR and BAO fluctuations. However, 
deconvolution requires a signal to noise per visibility 
greater than one (phase noise < one radian) and the first 
generations of instruments will not have the sensitivity. 

What if we measured every baseline (at the scale of X/2 
or better)? This would eliminate the contamination we 
have identified, not by removing the wedge but by driv- 
ing its amplitude to zero. With dense enough baseline 
coverage the incoherent summing of many line-of-sight 
oscillations in the righthand panel of Figure 2 would push 
the amplitude to zero. However, this criteria is equiva- 



lent to building an instrument with a (5-function PSF. 
Desirable but impractical. 

We believe that the foreground wedge identified here 
is fundamental, ft is a product of the information loss 
at the antenna scale suffered in making the interferomet- 
ric measurement, and the need to reconstruct a faithful 
'dirty' reconstruction of the sky. 

3.2. Location errors 

Location errors assume that the amplitude of the 
model source is correct, but the location is slightly 
wrong — typically by much less than one beamwidth. In 
the absence of calibration errors, this is equivalent to hav- 
ing two 5-functions with opposite sign very close to one 
another. After transforming to the u, / frame of Figures 
1 & 2, the Fourier shift theorem shows the contamina- 
tion to be the subtraction of two sinusoidal corrugations 
of slightly different frequency. These subtract perfectly 
at zero k± with the amplitude growing with k± as the 
corrugations slowly creep out of phase. In the limit of 
offsets much smaller than one beam (<C one radian of 
phase at longest baseline), this gives a sinusoidal corru- 
gation whose magnitude increases approximately linearly 
with k±. 

We can then repeat the argument of the previous sec- 
tion and Figures 1-4, the only difference being that the 
source of the contamination is a linearly increasing func- 
tion of fc_L instead of a constant. This limits the contam- 
ination to the same region as the amplitude errors, with 
the amplitude of the contamination proportional to 
along lines of A'n oc k±. Thus location errors preserve the 
same EoR window above k|| Max but with a distinctive 
linear increase in amplitude along lines of k\\ oc k±. This 
is the origin of the distinctive wedge shape seen in Figure 
10 of Datta et al. (2010). 

4. CALIBRATION ERROR SHAPES 

In this section we consider the effect of calibration er- 
rors. We now assume that the foreground model is equal 
to the true sky, but that the model instrumental response 
is not equal to the true response (B 7^ M, Equation 8). 
There are a number of ways the instrumental calibration 
can be wrong, from simple gain errors (visibility ampli- 
tude error) to errors in the antenna FoV or gain profile 
(wrong integral of the true uv plane) to assuming all 
antennas are identical (not accounting for antenna-to- 
antcnna variation) to antenna location or survey errors 
(error in location of a visibility) or full fledged direction 
and frequency calibration errors. 

To begin with we will concentrate on small frequency- 
independent errors on a subset of the baselines. Simple 
gain errors and differences between antennas will be re- 
flected in a few visibilities with the wrong complex am- 
plitude. If most of the visibility calibrations are correct 
(and the model is correct), the correct visibilities will 
subtract perfectly leaving only the erroneous component 
of the miscalibrated baselines. These calibration errors 
lead to two residual power spectrum shapes — a diffuse 
component that contaminates nearly all k modes and a 
third wedge shape that is similar to the foreground model 
residuals of §3 but with a distinct functional form. 

Figure 5 shows the effect of this calibration residual, 
where the dotted visibilities have subtracted perfectly 
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Fig. 5. — This cartoon shows the effect of caUbration errors in a 
portion of the u, / (or k± , line-of-sight Mpc) space. The diagonal 
lines show the paths of a number of visibilities (unequally spaced) . 
The dotted visibilities are correctly calibrated and have subtracted 
perfectly, but one visibility had a small calibration error so left 
a residual visibility (solid line). This residual visibility is gridded 
to the u, f plane by the grey shaded region. After Fourier trans- 
forming along the frequency direction (dashed line), this residual 
visibility produces contamination at all fcy modes. Effectively it 
appears as the Fourier transform of a windowed <5-function (see 
text for details). 



but the black visibility has not. An error in the gain 
amplitude leaves a residual visibility that is in phase 
with the underlying foreground signal, while a phase er- 
ror leaves a residual visibility that is ±7r/2 rad out of 
phase. The residual complex visibility is then gridded to 
the u, f plane as shown by the grey region. When tak- 
ing a Fourier transform along the line-of-sight direction 
(indicated by the dashed line) this gain error appears as 
a windowed (5-function. Effectively the calibration error 
for a visibility produces a sharp feature in the frequency 
direction, producing contamination over a wide range of 
fc|| modes. 

The contamination in fcy vs. k± space appears as a flat 
contamination at all fcy ((5-function FT) times a window 
function that falls with increasing fcy , as shown in Figure 
6. If the visibilities are gridded with the beam pattern 
B(v, u), the windowing function is a scaled version of the 
antenna beam pattern — the dashed line in Figure 5 tra- 
verses the uv beam pattern and is Fourier transformed 
to produce a scaled version of the angular beam pattern 
in the /cy direction. This is the most insidious of the fore- 
grounds detailed in this paper because the contamination 
extends into the 'EoR window.' 



o 




>- 

log k± 



Fig. 6. — This cartoon shows the effect of calibration errors in 
the fc|| vs. fc^ space. The first component reaches to high fcy , 
contaminating the EoR window. This contamination is windowed 
(represented by grey scale) , and the effectiveness of this window- 
ing depends on the details of the analysis. There is also a wedge 
component due to calibration errors that is similar to the two fore- 
ground wedges, but depends on the total baseline error with a ~ kj_ 
functional form. 



The calibration errors also produce another wedge 
shape. The visibilities with calibration errors see sources 
from across the sky and oscillate with frequency just like 
in Figure 2. This again produces the same kind of k 
space contamination, but because the sky is filled with 
sources it naturally contains sources from all distances 
from phase center up to the field-of-view cutoff. The dif- 
ference is that the contamination is only over the range 
of k± (or u) impacted by that baseline. For the ran- 
dom calibration errors on all baselines in Figure 11 of 
Datta et al. (2010), this produces a wedge of contamina- 
tion below k| I Max with a functional form that follows the 
baseline distribution. Unlike the the foreground model 
errors which are functions of the diagonal fcy — k± and 
fc|| = —kj_ axes, the calibration error is a function of the 
baseline distribution and is ^parallel to k±. 

The two simple shapes presented here are for frequency 
independent calibration errors. Frequency dependent 
calibration errors produce residuals that oscillate along 
the visibilities' lengths in / and map almost directly into 
the fcy direction. It is hoped that many of these calibra- 
tion errors will be slow and map to small fcy , but they will 
generate additional foreground components that are not 
qualitatively described here. It is, however, straightfor- 
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ward to calculate their shapes. The kinds of errors that 
appear in the calibration B can be parametrized, and for 
each kind of error the residual power spectrum p(k) can 
be explicitly calculated using Equation 8. This will be 
an important step for all 21 cm power spectrum mea- 
surements, but also is quite instrument dependent. Here 
wc have concentrated on four power spectrum residuals 
that will be important for all upcoming observations. 

5. DISCUSSION 

Using the results of §2-§4 we can quantitatively and 
qualitatively understand the origin of some of the dif- 
ferent wedge-shaped structures seen in advanced simu- 
lations. This understanding also allows us to invert the 
process and associate observed power spectrum shapes 
with specific calibration and foreground subtraction er- 
rors. In practice, we feel it is this ability to identify the 
origin of observed power spectrum contaminations that 
will be the most influential result of this work. 

One of the key problems facing EoR observations is 
identifying calibration and foreground subtraction errors 
below the imaging limit. When you arc working several 
orders of magnitude below the confusion limit, different 
subtraction or calibration algorithms can produce nearly 
identical images with very different power spectra. Fur- 
ther, when looking at measured power spectra in fcy vs. 
kj_, to date it has been impossible to identify the causes 
of observed contamination. Using the developments in 
this article, not only can we compare the performance 
of these analyses, we can identify the specific sources of 
observed contamination. A few examples might include: 

• Two calibration algorithms that both produce good 

images can be compared, and their contribution to 
the power spectrum in the EoR window can be used 



to pick the superior algorithm (§4). 

• Symmetric bands of power as seen in Figure 4 (ac- 
counting for projection), can be used to identify 
higher source subtraction errors near the field edge 
(§3.1). 

• Linearly increasing bands of power can be used to 

identify pointing offsets in particular regions of the 
survey (e.g., far from a calibrator or near the iono- 
spheric equator, §3.2). 

• Systematic errors can be estimated by observing 
the contributions of specific foregrounds and their 
covariance with the EoR signal (§2, Morales et al. 
2006). 

In this paper we have described four types of mode- 
mixing contamination any EoR instrument will see. The 
next paper in this series will explore the array-dependent 
effects and the influence of inhomogeneities in the base- 
line distribution (Hazelton et al. in preparation). Fu- 
ture papers will use these results to identify and quan- 
tify the mode-mixing contamination seen in 32 antenna 
MWA observations, and develop more advanced statisti- 
cal methods to mitigate the contaminations due to cali- 
bration errors and array inhomogeneities. 
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